Concentration inequalities for sampling without replacement, with applications to post-election audits

July 29, 2020 (Zoom - See email or contact organizers for link)

Note that we (the organizers) forgot to record this talk, but the link above is a recording of Aaditya giving the same talk at Waterloo.

Abstract: Many practical tasks involve sampling sequentially without replacement from a finite population in order to estimate some parameter, like a mean. We discuss how to derive powerful (new) concentration inequalities for this setting using martingale techniques, and apply it to auditing elections (see below).

This is joint work with my PhD student, Ian Waudby-Smith. An early preprint is available here: https://arxiv.org/abs/2006.04347.

More details: When determining the outcome of an election, electronic voting machines are often employed for their tabulation speed and cost-effectiveness. Unlike paper ballots, these machines are vulnerable to software bugs and fraudulent tampering. Post-election audits provide assurance that announced electoral outcomes are consistent with paper ballots or voter-verifiable records. We propose an approach to election auditing based on confidence sequences (VACSINE)—these are visualizable sequences of confidence sets for the total number of votes cast for each candidate that adaptively shrink to zero width. These confidence sequences have uniform coverage from the beginning of an audit to the point of an exhaustive recount, but their main advantage is that their error guarantee is immune to continuous monitoring and early stopping, providing valid inference at any auditor-chosen, data-dependent stopping time. We develop VACSINEs for various types of elections including plurality, approval, ranked-choice, and score voting protocols.

Bio: Aaditya Ramdas is an assistant professor at Carnegie Mellon University, joint in the Departments of Statistics and Machine Learning. His work is supported by an NSF Career Award and an Adobe Faculty Research Award. Aaditya's main theoretical and algorithmic research interests include selective and simultaneous inference, sequential uncertainty quantification and distribution-free black-box predictive inference. His areas of applied interest include neuroscience, genetics and voting.